Making Convolutions Resilient Via Algorithm-Based Error Detection Techniques
نویسندگان
چکیده
Convolutional Neural Networks (CNNs) are being increasingly used in safety-critical and high-performance computing systems. As such systems require high levels of resilience to errors, CNNs must execute correctly the presence hardware faults. Full duplication provides needed assurance but incurs a prohibitive 100 percent overhead. In this article, we focus on algorithmically verifying convolutions, most resource-demanding operations CNNs. We use checksums verify convolutions. identify feasibility performance related challenges that arise detecting errors convolutions optimized CNN inference deployment platforms (e.g., TensorFlow or TensorRT GPUs) fuse multiple network layers reduced-precision operations, demonstrate how overcome them. propose evaluate variations algorithm-based error detection (ABED) techniques offer implementation complexity, runtime overhead, coverage trade-offs. Results show ABED can detect all transient might otherwise corrupt output with low overheads (6-23 percent). Only about 1.4 total computations not protected by ABED, which be duplicated for full protection. compute-intensive duplicating rest at least 1.6× throughput compared duplication.
منابع مشابه
An Error-Resilient Blocksorting Compression Algorithm
A Burrows-Wheeler Compressor breaks input into blocks, quickly makes each more compressible, and compresses the modified block with a simple arithmetic or Huffman compressor. We propose an error-resilient Inverse Burrows-Wheeler Compressor. It uses a small amount of overhead alongside output from an ordinary BWT and MTF. It is also size-competitive with BZIP, a popular Burrows-Wheeler compressor.
متن کاملError Resilient Video Coding Techniques Using Spare Pictures
In error prone environments not all the picture data can be received correctly during video transmission. Error propagation introduced by employing predictive coding may degrade the sequence quality severely. A novel method, called spare pictures, is proposed to indicate the similarity between a reference picture and other pictures. With the help of the signaled spare pictures information, rece...
متن کاملComplementarity of Error Detection Techniques
We study explicit techniques for detection of safety errors, e.g., depth-first search, directed search, random walk, and bitstate hashing. We argue that it is not important to find the best technique, but to find a set of complementary techniques. To this end, we choose nine diverse error detection techniques and perform experiments over a large set of models. We compare speed of techniques, le...
متن کاملAn Adaptive-Parity Error-Resilient LZ'77 Compression Algorithm
The paper proposes an improved error-resilient Lempel-Ziv'77 (LZ'77) algorithm employing an adaptive amount of parity bits for error protection. It is a modified version of error resilient algorithm LZRS'77, proposed recently, which uses a constant amount of parity over all of the encoded blocks of data. The constant amount of parity is bounded by the lowest-redundancy part of the encoded strin...
متن کاملError-resilient Video Compression via Multiple State Streams
Video compression enables a number of applications by reducing the required bit rate needed to represent a video sequence, however the compressed video is much more susceptible to errors, e.g. bit errors or packet loss. Conventional video compression standards employ an architecture which we refer to as single-state systems since they have a prediction loop with a single state (e.g. the previou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Dependable and Secure Computing
سال: 2022
ISSN: ['1941-0018', '1545-5971', '2160-9209']
DOI: https://doi.org/10.1109/tdsc.2021.3063083